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Title of the Invention: 

Method and Apparatus For Performing 
Multiple Simultaneous Manipulations Of 
Biomolecules In A Two-Dimensional Array 

5 Field of the Invention: 

This invention relates to methods and apparati for performing multiple 
simultaneous manipulations of biomolecules in a two-dimensional array, such as a 
gel, membrane, tissue biopsy, etc. Such manipulations particularly include assays 
and nucleic acid amplification protocols. 

1 0 Statement of Governmental Interest 

This invention was fiinded by the Intramural Research Program at the 
National Institutes of Health. The United States Government has certain rights to 
this invention. 

Background of the Invention: . 

1 5 An emerging approach to the investigation of disease and cellular state 

involves the analysis of the complete set of RNA molecules - the "transcriptome" 
- expressed by a cell or tissue sample (see, Devaux, F. (2001) "Transcriptomes, 
Transcription Activators And Microarrays," FEBS Lett. 498:140-4; see 
also, United States Patents Nos. 6,221,600; 6,303,308; 6,461,814; European Patent 

20 Publications Nos. EP 0970202a2, 1 174521a3, 1 190044a2; Japanese Patent 

Application JP 20021 42765a2, and PCT International Patent Publications Nos. 
WO 0077214a3; WO 0138577a3; WO 02068466a2; WO 02I0449a2; WO 
0246465a2 and WO 9832847a2). 

High-throughput gene expression array and proteomic technologies make 
25 possible the simultaneous analysis of thousands of mRNA transcripts and proteins, 
allowing a global view of the molecular events associated with normal cellular 
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processes and disease states (Schena, M. et al. (1995) "Quantitative 

MONITORING OF GENE EXPRESSION PATTERNS WlTH A COMPLEMENTARY DNA 

MlCROARRAY » Science 270: 467469; Schena, M. et al (1 998) "Microarrays: 

BIOTECHNOLOGY'S DISCOVERY PLATFORM FOR FUNCTIONAL GENOMICS," Trends 
BiotechnoL 16: 301-306; x DeRisi, J. etal. (1996) "USE Of A cDNA MlCROARRAY 
TO ANALYSE GENE EXPRESSION PATTERNS IN HUMAN CANCER," Nat Genet 14: 

457-460; Chee, M.etal. (1996) "Accessing Genetic Information With High- 
Density DNA ARRAYS. Science 274: 610-614; Lander, E. (1999) "Array Of 
Hope." Nat. Genet. 21: 3-4; Emmert-Buck, MR. et al. (2000) "A STRATEGIC 
Approach For Proteomic Analysis Of Human Tumors," Mol. Carcin. 27: 1 -8; 
Emmert-Buck, M.R. etal. (2000) "MOLECULAR PROFILING OF CLINICAL TISSUE 
SPECIMENS: FEASIBILITY AND APPLICATIONS," Am. J Pathol. 156: 1 1 09-11 15; 

Celis, J. etal. (2000) "Gene Expression Profiling: Monitoring Transcription 
And Translation Products Using DNA Micro Arrays And Proteomics," 
FEBS Lett. 480: 2-16; Anderson, N.L. et al. (1998) "PROTEOME And Proteomics: 
New Technologies, New Concepts, And New Words," Electrophoresis 19: 
1 853- 1 861 ; Duggan, D.J. et al. (1999) "EXPRESSION PROFILING USING CDNA 
MICROARRAYS," Nat. Genet 21:10-14; Khan, J. et al. (1 999) "EXPRESSION 
Profiling In Cancer Using cDNA Microarrays," Electrophoresis 20: 223-229; 
Lipshutz, RJ. et al. (l 999) "High Density Synthetic Oligonucleotide 
Arrays," Nat. Genet. 21: 20-24; Lockhart, D.I. etal. (1996) "EXPRESSION 
MONITORING BY HYBRIDIZATION TO HIGH-DENSITY OLIGONUCLEOTIDE ARRAYS," 

Nat. BiotechnoL 14: 1675-1680; Velculescu, V. et al. (1995) "Serial Analysis 
Of GENE EXPRESSION," Science 270: 484-487; Liotta, L. et al. (2000) 
"Molecular Profiling Of Human Cancer," Nature Reviews Genetics 1 : 48- 
56). 

The emergence of transcriptome analysis has, however, been encumbered 
by the limitations of existing methodologies. Typically, such technologies identify 
a subset of genes (from a few dozen to several hundred) whose expression profile 
provides novel insight into cellular physiology and/or allows disease states to be 
segregated on a molecular rather than a phenotypic basis (Perou, C. et al. (2000) 
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"MOLECULAR PORTRAITS OF HUMAN BREAST TUMOURS," Nature 406: 747-752; 
Alizadeh, A.A. et al. (2000) "Distinct Types Of Diffuse Large B-Cell 
Lymphoma Identified By Gene Expression Profiling," Nature 403: 503-51 1; 
Dhanasekaran, S. et al. (2001) "DELINEATION OF PROGNOSTIC Biomarkers In ' 
5 PROSTATE Cancer," Nature 412: 822-826; Hedenfalk, I. et al. (2001) "Gene- 
EXPRESSION PROFILES IN HEREDITARY BREAST CANCER," N Engl. J. Med. 344: 
539-548; Golub, T.R. el al. (1999) "MOLECULAR CLASSIFICATION Of Cancer: 
Class Discovery And Class Prediction By Gene Expression Monitoring. 
Science 286:53 1-537; Klose, J. (1999) "Genotypes And Phenotypes," 

10 Electrophoresis 20: 643-652; Strausberg, R.L. et al. (2000) "The Cancer Genome 
Anatomy Project: Building An Annotated Gene Index," Trends Genet 
1 6: 1 03- 1 06; Zhang, L. et al (l 997) "Gene Expression Profiles In Normal And 
Cancer CELLS," Science 276: 1268-1272). Although these studies provide 
valuable information, it is desirable to independently confirm and quantitatively 

1 5 measure the expression level of each of the genes of interest Prior to the advent of 
the present invention, this represented a significant challenge in terms of time and 
effort. Moreover, the amount of biological sample available for subsequent 
investigation is often limiting, particularly in the case of developmental biology 
samples and clinical specimens. 

20 Although transcriptome analysis can be conducted by performing multiple 

Northern blots (Aldaz, CM. et al., (2002) "Serial Analysis Of Gene 
Expression In Normal p53 Null Mammary Epithelium," Oncogene 2 1 :6366- 
6376), this approach can be laborious and time-consuming (see, Su, A.I. et al. 
(2002) "Large-Scale Analysis of The Human And Mouse Transcriptomes," 

25 Proc. Natl. Acad. Sci USA. 99:4465-70). More fundamentally, such an analysis 
is inherently biased against low abundance transcripts. 

Various protocols have likewise been developed to generate cDNA libraries 
from globally amplified RNA of single cells (Belyavsky, A et al. (1 989) "PCR- 
Based cDNA Library Construction: General cDNA Libraries At The 
30 Level Of A Few Cells," Nucl. Acids Res. 17:2919-2932; Brady, G.et al. (.1993) 
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"CONSTRUCTION OF cDNA Libraries From Single Cells," Meth. Enzymol. 
225:61 1-623; Karrer, E.E. et aL (1995) "In situ isolation of mRNA from 

INDIVIDUAL PLANT CELLS: CREATION OF CELL-SPECIFIC CDNA LIBRARIES," Proc. 

Natl. Acad. Sci. USA 92:3814-3818), and cDNA microarrays have been used to 
5 analyze gene expression patterns (DeRisi, J. et al (1 996) "Use Of A cDNA 
MlCROARRA Y TO ANALYSE GENE EXPRESSION PATTERNS IN HUMAN CANCER" 
Nature Genetics 74:457-60; Li, S. et al (2001) "COMPARATIVE GENOME-SCALE 
ANALYSIS OF GENE EXPRESSION PROFILES IN T CELL LYMPHOMA CELLS DURING 
MALIGNANT PROGRESSION USING A COMPLEMENTARY DN A MlCROARRAY," Amer. 
10 J. Pathol. 158:1231-1237; Saha, S. et al (2002) "USING THE TRANSCRIPTOME TO 
ANNOTATE the GENOME," Nat Biotechnol. 20:508-512; Bpno, H. etal. (2002) 
"FUNCTIONAL TRANSCRIPTOMES: COMPARATIVE ANALYSIS OF BIOLOGICAL 
PATHWAYS AND PROCESSES IN EUKARYOTES TO INFER GENETIC NETWORKS AMONG 
TRANSCRIPTS," Curr Opin Struct Biol. 12:355-361; Schena, M. et al (1995) 
1 5 "QUANTITATIVE MONITORING OF GENE EXPRESSION PATTERNS WlTH A 

COMPLEMENTARY DNA MlCROARRAY" Science 270:467-70; Anisimov, S. et al 
(2002) "A Quantitative and Validated SAGE Transcriptome Reference 
for Adult Mouse Heart," Genomics 80:21 3-222). 

Unfortunately, however, all such protocols have drawbacks, including the 
20 selective amplification of the 3* ends of a transcript, insufficient sensitivity in 

amplification (Klein, C.A. etal (2002) "Combined transcriptome and genome 
ANALYSIS OF SINGLE MICROMETASTATIC CELLS," Nat. Biotechnol. 20:387-92) and 
the problem of distinguishing critical transcript species from merely abundant 
transcripts (Curtis, R.K. etal (2002) "CONTROL ANALYSIS OF DNA MlCROARRAY 
25 EXPRESSION data," Molec. Biol. Rep. 29:67-71). While microarrays permit one to 
compare transcriptomes of different cells and tissues, they do not retain 
information concerning the architecture or location of the detected transcripts. 
Techniques of in situ hybridization and amplification have been developed to 
permit the localization of RNA transcripts, however such techniques focus on one 
30 or a small number of genes and do not assess the expression of the transcriptome 
(see, e.g., Nuovo, GJ. (2001) "Co-labeling using insituPCR: A review," J. 
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Histochem. Cytochem. 49:1329-1339; Moore, J.G. etal (2000) "HER-2/Neu 
Gene Amplification In Breast Imprint Cytology Analyzed By 
Fluorescence In Situ Hybridization: Direct Comparison With Companion 
Tissue Sections," Diagn. Cytopathol. 23:299-302; Seeds, M.C. et al (2000) 
5 "Cell-Specific Expression Of Group X And Group V Secretory 

Phospholipases A(2) In Human Lung Airway Epithelial Cells," Amer. J, 
Respir. Cell. Mol. Biol. 23:37-44). 

Various approaches have been attempted to "capture" the 2-dimensional 
positional relationship between molecules of a sampled array. A paraffin block has 

1 0 been described (http^/cmag.cit.nih.gov/Tissuearrav.htm^ in which multiple cores 
(50-500) of tissue are placed in an organized grid. The device is a said to be 
amenable for use in a variety of experiments, including immunohistochemistry, 
immunofluorescence, FISH, in situ hybridization, and to provide a high throughput 
platform for tissue, in which hundreds of samples can be analyzed at one time, and 

1 5 multiple experiments can be performed on the same array (see, 

http://www.laboreLno/Acrobat/Biogenex/Biolink%20VOL.Ddfi . Microwell and 
microtiter plates (e.g., Thermo Labsysteras 384- Well Solid Microtiter Plate) are 
example of 2-dimensional arrays of partitioned grids or chambers. 

The problem of detecting high relevance, low abundance, transcripts is of 
20 particular significance in the analysis of complex tissue samples. Advanced 

technologies, such as the "Gene Chip" are reportedly able to detect no more than 
30% of the transcripts present in complex tissue samples (Evans, S J. et al (2002) 

"Evaluation of Affymetrjx Gene Chip Sensitivity In Rat Hippocampal 

i 

Tissue Using SAGE Analysis. Serial Analysis of Gene Expression," Eur. J. 

25 Neurosci. 1 6:409-1 3; Piper, MD.W. et al (2002) "Reproducibility Of 
Oligonucleotide Microarray Transcriptome Analyses: An 
Interlaboratory Comparison Using Chemostat Cultures Of 
Saccharomyces CEREVisiAE," J. Biol. Chem. 277:37001-37008). Where relevant 
cells (i.e., those associated with the production of high relevance, low abundance, 

30 Vanscripts) can be identified, techniques such as microdissection or laser-capture 
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microscopy may be employed (Emmert-Buck, M.R. et al (1996) "Laser CAPTURE 
Microdissection," Science 274:998-1001; Bonner, R.F. et al (1997) "Laser 
Capture Microdissection: Molecular Analysis Of Tissue," Science 
278:1481-1483), however, in many cases such relevant cells have not been 
5 identified, or cannot be detected. 

In particular, there is an important need for new technologies that facilitate 
follow-up analysis of array- and proteomic-derived data. Although many such 
approaches are under development each has its particular weaknesses (see, for 
example, Kononen, J. et al. (1998) "Tissue Micro Arrays ForHigh- 

1 0 Throughput Molecular Profiling Of Tumor Specimens," Nat Med. 4: 844- 
847; Berndt, P. et al (1999) "RELIABLE AUTOMATIC PROTEIN IDENTIFICATION 
From Matrix-Assisted Laser Desorption/Ionization Mass Spectrometric 
Peptide Fingerprints," Electrophoresis 20: 3521-3526; Binz, PA. etal. (1999) 
"A Molecular Scanner To Automate Proteomic research and To Display 

15 Proteome IMAGES," AnaLChem. 71 : 4981-4988; Bubendorf, L. et al. (1999) 
"Survey Of Gene Amplifications During Prostate Cancer Progression By 

HlGH-THROUGHOUT FLUORESCENCE IN SlTU HYBRIDIZATION ON TISSUE 
MlCROARRAYS," Cancer Res. 59: 803-806; Celis, J.E. etal. (1999) "2D PROTEIN 
Electrophoresis: Can It Be Perfected?" Curr. Opin. Biotechnol. 10:16-21; 

20 Humphery-Smith, I. (1 998) "Proteomics: From Small Genes To High- 
Throughput Robotics," J. Protein Chem. 17:524-525; Herbert, B. (1999) 
"Advances In Protein Solubilization For Two-Dimensional 
Electrophoresis," Electrophoresis 20: 660-663; Liu. Y. et al. (1999) "AcnvrrY- 
Based Protein Profiling: The Serine Hydrolases," Proc. Natl. Acad. Sci. 

25 USA. 96: 14694-14699; Lueking, A. et al. (1999) "Protein Micro Arrays For 
Gene Expression And Antibody Screening," Anal. Biochem. 270: 103-1 1 1; 
Quadroni, M. et al (1999) "Proteomics And Automation," Electrophoresis 20: 
664-677; Yates, J.R., 3 rd (2000) "MASS SPECTROMETRY: FROM GENOMICS To 
Proteomics," Trends Genet. 16: 5-8; Sidransky, D. (2000) "Emerging 

30 Molecular Markers Of Cancer," Nature Reviews Cancer 2:210-21 9). 



«0**-S875*fr » JL JL-SSS3 



Thus, despite all such advances, the development of a global amplification 
system remains 'the most critical hurdle" to transcriptome analysis. (Klein, C.A. et 

al. (2002) "COMBINED TRANSCRIPTOME AND GENOME ANALYSIS OF SINGLE 

Micrometastatic Cells," Nat. Biotechnol. 20:387-92). A need thus remains for 
an apparatus and method that would permit multiple, preferably simultaneous, 
manipulations of the biomolecules present in a two-dimensional array, such as a 
gel, or other solid support. The present invention is directed to this and other 
needs. 

Summary of the Invention: 

This invention relates to methods and apparati for performing multiple 
simultaneous manipulations of biomolecules in a two-dimensional array, such as a 
gel, membrane, tissue biopsy, etc. Such manipulations particularly include assays 
and nucleic acid amplification protocols. 

In detail, the invention concerns a method for analyzing the transcriptome 
of a cellular sample comprising analyzing two or more molecular species present 
in a 2-dimensional array of the cellular sample, wherein the method comprises 
treating the 2-dimensional array with an External Movement Inhibitor device 
having multiple discrete partitions, so as to sequester molecules present in the array 
into one or more discrete regions, wherein the treatment preserves the positional 
relationship of the molecules of the 2-dimensional array, and permits a 
determination of the locations) in the cellular sample in which the molecular 
species are present. 

The invention further concerns the embodiment of such method wherein 
the cellular sample is a cellular sample obtained from a mammal (especially 
wherein the mammal is a human). 



The invention further concerns the embodiment of such methods wherein 
the cellular sample is a tissue sample (especially a biopsy). 
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The invention particularly concerns the embodiment of such methods 
wherein the molecular species are nucleic acid molecules. The invention further 
concerns the embodiment of such method wherein the method additionally 
comprises incubating the sequestered nucleic acid molecules of two or more 
5 regions under conditions sufficient to permit the manipulation of one or more 
preselected nucleic acid molecules if present at the regions, while preserving the 
positional relationship of the molecules relative to other molecules of the 2- 
dimensional array. The invention further concerns the embodiment of such 
method wherein the method comprises incubating the sequestered nucleic acid 

10 molecules of all of the regions under conditions Sufficient to permit the 

manipulation of the one or more preselected nucleic acid molecules. The invention 
further concerns the embodiment of such methods wherein the method additionally 
comprises transferring the manipulated nucleic acid species to two or more 
membranes, the membranes being differentially treated to enable the determination 

15 of the location(s) of manipulated nucleic acid speciesi The invention further f 
concerns the embodiment of such methods wherein the manipulation is selected 
from the group consisting of nucleic acid amplification, reverse transcription, 
labeling, cloning, and the assaying of a biomolecule. The invention further 
concerns the embodiment of such methods wherein one or more of the preselected 

20 nucleic acid molecule(s) are diagnostic of a disease state and/or wherein the 
nucleic acid molecules are amplified using a polymerase chain reaction. 

The invention further concerns the embodiment of such methods wherein 
the cellular sample is an extract of a cell, and the 2-dimensional array is a gel or 
membrane that arrays the nucleic acid molecules. 

25 The invention further concerns the embodiment of such methods wherein 

the method additionally comprises incubating the sequestered nucleic acid 
molecules of two or more regions under conditions sufficient to permit the 
amplification of one or more preselected nucleic acid molecules if present at the 
regions, while preserving the positional relationship of the molecules relative to 

30 other molecules of the 2-dimensional array. The invention additionally concerns 
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the embodiment of such methods wherein the method comprises incubating the 
sequestered nucleic acid molecules of all of the regions under conditions sufficient 
to permit the amplification of the one or more preselected nucleic acid molecules. 
The invention further concerns the embodiment of such methods wherein the 
method additionally comprises transferring the amplified nucleic acid species to 
two or more membranes, the membranes being differentially treated to enable the 
determination of the locations) of amplified nucleic acid species. The invention 
further concerns the embodiment of such methods wherein the manipulation is 
selected from the group consisting of nucleic acid amplification, reverse 
transcription, labeling, cloning, and the assaying of a biomoiecule. The invention 
further concerns the embodiment of such methods wherein one or more of the 
preselected nucleic acid molecuIe(s) are diagnostic of a disease state and/or 
wherein the nucleic acid molecules are amplified using a polymerase chain 
reaction. 

The invention particularly concerns the embodiment of such methods 
wherein the molecular species are protein molecules. The invention further 
concerns the embodiment of such method wherein the cellular sample is an extract 
of a cell, and the 2-dimensional array is a gel or membrane that arrays the 
molecules of the extract. The invention further concerns the embodiment of such 
method wherein one or more of the protein molecule(s) are diagnostic of a disease 
state. 

Brief Description of the Figures: 

Figure 1 is a schematic illustration of a preferred embodiment of the 
invention, and its use in layered expression scanning (LES). Panel A.: A 
biological sample is placed adjacent to a multilayered set of membranes. Panel B.: 
The specimen is transferred through the layers as an intact two-dimensional object 
For the analysis step, LES can be used as either a closed or open system. In the 
open version, the membranes bind a subset of the target molecules, resulting in a . 
series of •replicate* membranes that can be subsequently probed for specific genes 
or proteins. In the closed version, desired molecules (e.g., specific antibodies or 
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DNA molecules, etc.) are placed on each membrane. The biological sample is pre- 
labeled and passed through the layers. Panel C: Desired molecules (e.g., proteins, 
mRNAs, etc.) are captured by their corresponding membrane and the expression 
level of each is measured. In this application, the membranes permit non-target 
5 molecules to pass through unimpeded; thus the closed system can be used with a 
large number (100 or more) of layers. Panel D.: Illustration of the use of LES in an 
automated or user-directed manner to allow comprehensive study of bio-samples in 
an automated fashion. An External Movement Inhibitor device (EMI) is 
employed to facilitate the amplification and/or other analysis of biomolecules 
1 0 arrayed on a gel, tissue sample, membrane, etc. LES software allows investigators 
to move seamlessly through the data sets, facilitating comparison of expression 
levels of multiple biomolecules in different cell population, regions, etc. 

Figure 2 illustrates an "open end" embodiment of the EMI apparatus of the 
present invention. 

1 5 Figure 3 illustrates a sampling needle embodiment of the EMI apparatus of 

the present invention. 

Figure 4 illustrates a closed end "well" embodiment of the EMI apparatus 
of the present invention. 

Description of the Preferred Embodiments: 

20 The invention concerns a method and apparatus for accomplishing and/or 

facilitating the analysis of multiple biomolecules arrayed in a two-dimensional 
(2D) array, such as a gel or other solid support. 

The invention is particularly suited for use in Layered Expression Scanning 
(LES). LES is a new technology co-developed by the National Institutes of Health 
25 and 20/20 GeneSystems, Inc. (http://www.2020gene.com/) (Englert, C.R. et al 
(200) "Layered Expression Scanning: Rapid Molecular Profiling Of 
TUMOR SAMPLES," Cancer Res. 60: 1 526-1 530). The method utilizes a layered 
array of membranes for molecular analysis and can be applied to a variety of life 
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science platforms, including tissue sections, cells in culture, electrophoresis gels, 
. multi-well plates, and tissue arrays (Figure 1). The technique is preferably 
performed by passing the sample through the series of membrane layers while 
maintaining two-dimensional architecture, thus permitting the concurrent 
5 measurement of different RNA transcripts or proteins in each of the individual 
sample elements (e.g., various cellular phenotypes in a tissue section, bands on a 
gel, individual wells of a microtiter plate). The method has a number of properties 
that increase its utility. It is conceptually simple, requires no moving parts, can be 
used as an open or closed format, and maintains target biomolecules at a high 
1 0 concentration during the analysis process to produce sensitive measurements. 

In preferred embodiments, the invention employs a separator, referred to 
herein as an External Movement Inhibitor device ("EMI") that can be imposed 
upon a two-dimensional solid array to isolate and sequester those biomolecules 
located at one region of the array from those biomolecules located at a different 
15 region of the array. Since the EMI sequesters the various molecular species, the 
architecture of the 2-dimensional pattern, and the spatial relationships of the 
molecules in the array are preserved. 

The biomolecules of relevance to the present invention may be nucleic acid 
molecules (RNA, or DNA), proteins (enzymes, immunoglobulins, receptors, 

20 receptor ligands, hormones, antigens, etc.), carbohydrates or lipids. By isolating 
and sequestering the biomolecules of the array into discrete regions, the invention 
permits the analysis of some or all of such regions, so as to permit the detection of 
classes of molecules (e.g., nucleic acid molecules, proteins, etc.), and/or specific 
molecules (e.g., DIM A / RNA molecules of interest (for example those associated 

25 with the presence of a pathogen, or a tumor marker or receptor, etc.), or specific 
proteins (enzymes, cellular factors, receptors and receptor ligands, tumor markers, 
etc.), carbohydrates, metals, reporter molecules, etc.) that may be present in such 
regions. 
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Such analysis may optionally be conducted multiple times to thereby 
permit the detection of molecules in multiple sequestered regions. In a preferred 
embodiment, for example, a plurality of regions, and more preferably, all regions, 
sequestered by the EMI will be simultaneously evaluated for the desired 
5 molecules. In a preferred embodiment of the invention, such analysis will be 

performed on a series of replicated samples, so as to permit multiple evaluations to 
be accomplished, each investigating the presence of different molecules and/or 
different classes of molecules for some, most, and more preferably, all of the 
sequestered regions. 

10 The same or different analyses can be conducted at different times in order 

to examine the change in concentration, state or nature of particular biomolecules 
over time at a particular sequestered region of the array, or at multiple sequestered 
regions. 

The two-dimensional arrays of relevance to the present invention may be 
15 any of a wide variety of solid supports. Such supports particularly include "1- 
dimensional" gels (especially polyacrylamide gels, agarose gels, etc.), such as 
those typically employed to assess molecular weight, and "2-dimensional" gels, 
such as those employed to simultaneously separate analytes based on molecular 
size and charge. Other preferred supports include membranes (such as paper, 
20 nitrocellulose, etc.). Nitrocellulose membranes sold by 20/20 GeneSystems, Inc. 
(Rockville, MD) are particularly preferred supports. The supports that may be 
used in accordance with the present invention also include tissue specimens, 
biopsies, etc. 

Preferably, the EMI devices of the present invention will have a substantial 
25 number of grids, or wells, so as to be able to sequester the molecules of the array 
into a substantial number of regions. The devices are preferably suitable for 
sequestering the molecules present in closely positioned regions of the array so as 
to allow for nucleic acid/protein manipulation (amplification, reverse transcription, 
labeling, cloning, assaying of biomolecules, etc.) while maintaining the two- 
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dimensional spatial orientation of the molecules of the array. Preferably, such 
number of grids, or wells will be greater than 50, more preferably, greater than 
100, still more preferably, greater than 300, or greater than 1000. Preferred EMI 
devices include 96, 384, and 1536 well microtiter dishes.. 

In one embodiment, the EMI may comprise a grid that is open on both top 
and bottom (Figure 2), and can be used as a "punch" to sequester the biomolecules 
present in different regions of the array, which can then be analyzed in situ, or 
removed from the array (as by aspiration, inoculation, etc.) and subsequently 
analyzed. In an alternative embodiment (Figure 3), the EMI may comprise a 
plurality of probes that may be hollow or solid, and that may be used to sample the 
biomolecules present in different regions of the array. In a preferred sub- 
embodiment of this embodiment of the invention, the probes or needles are first 
placed into contact with the array, and then contacted with a filter, gel, or other 2- 
dimensional surface that can be assayed for desired biomolecules. In a further, 
embodiment (Figure 4), the EMI may comprise a "partially closed" grid, such that 
when interposed into the array, the biomolecules of the array regions are captured 
into chambers or "wells." Conventional microtiter plates, for example may be 
employed for such use. 

Having now generally described the invention, the same will be more 
readily understood through reference to the following examples, which are 
provided by way of illustration, and are not intended to be limiting of the present 
invention, unless specified. 

Example 1 
EMI Devices 

In one example of the invention, RNA transcripts of a biological sampl 
analyzed via gel electrophoresis to separate the transcripts by size, and the gel is 
then placed adjacent to an EMI and the entire complement of RNA is transferred 
into the wells. The EMI's grid causes each transcript to migrate directly into a 
well at precisely its position in the gel. By placing the EMI into a thermocycler, 
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RT-PCR can be simultaneously performed on all of the transcripts (each localized 
in their own well), thereby significantly increasing the number of cDNAs 
corresponding to each mRNA. Significantly, such a process will lead to the 
amplification (and detection) of all transcripts, even those of low abundance. The 
5 EMI containing the amplified cDNAs may be analyzed by standard layered 
expression scanning (LES) analysis by transferring them to a stack of LES 
membranes. 

The net effect of using the EMI in this example is to dramatically increase 
the nucleic acid content such that more replicate LES membranes can be produced, 
10 and/or, more sensitive measurements can be made, including low abundance 

transcripts that are not easily measurable on a standard northern blot (i.e., without 
an amplification step). Similar experiments could be carried out for DNA sample 
using Southern blots. 

Example 2 

1 5 . Direct Cell Target Analysis (DCTA) 

A second application of the EMI devices of the present invention involves 
using an EMI device to facilitate Direct Cell Target Analysis (DCTA). In this 
method, a DCTA polymer is employed to target specific cells in a tissue section, 
and then all of the cellular material is transferred into an EMI. Since such transfer 

20 would maintain the section's 2-dimensional architecture, each well would contain 
the cellular proteins corresponding to specific regions/cells of the tissue section. 
The wells that contain the targeted cells would also contain the DCTA polymer 
(which could be attached to a labeling enzyme, e.g., lactoperoxidase, etc. to label 
proteins with I !25 ). The labeling reaction could then be carried out simultaneously 

25 in all of the wells of the EMI. Using the EMI device in this manner would 

facilitate the DCTA labeling reaction by providing a soluble environment, thereby 
permitting efficient and complete protein labeling to occur. Similar experiments 
could be carried out using reverse transcriptase (in place of lactoperoxidase) to 
produce labeled first-strand cDNA for microarray experiments. 
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Example 3 

Multiplex Analysis of RNA and Protein Gels 
Using Layered Expression Scanning 

To specifically facilitate multiplex analysis of RNA and protein 
5 electrophoresis gels using layered expression scanning as an open system, a 
unique, highly efficient hybridization membrane was developed that permits ten 
blots to be produced from a single gel. Each blot can be subsequently probed for 
individual transcripts or proteins using standard procedures. The method increases 
the throughput rate of northern and immuno blots and importantly, permits an 
1 0 increase in the number of molecular measurements that can be made per 

biosample. The described technique makes feasible quantitative determination of 
the expression level of a large set of genes and proteins, such as those typically 
identified in array and proteomic studies. To assess the technical capabilities of 
the new membrane system, die following performance parameters were evaluated: 
1 5 hybridization characteristics, signal sensitivity, and reproducibility relative to 
standard blots. 

Multiplex Northern Blots Hybridization and/or total RNA binding 
characteristics of the layered array membranes were assessed in several 
experiments as follows. Total RNA.(l5-30 \ig) from cell lines MDA-MB-453 

20 (Geneka Biotechnology Inc, Montreal Quebec), Jurkat (Geneka Biotechnology Inc, 
Montreal Quebec), HeLa (Ambion Inc, Austin TX), Osteosarcoma MG-63 
(Ambion Inc, Austin TX) was electrophoresed in a 1 %, denaturing agarose gel. 
After washing the gel twice for five minutes in DEPC-treated water, a standard 
northern blot capillary transfer was performed overnight, except that the ten-layer 

25 membrane system (20/20 GeneSystems) was substituted for a nitrocellulose 
membrane. The following setup was utilized (from bottom to top): 20X SSC 
transfer buffer, transfer paper (Gel Blot Paper, Schleicher & Schuell, Keene, NH), 
agarose gel, 10-layer membrane set (20/20 GeneSystems, Rockville, MD), one 
nitrocellulose membrane (Protran, Schleicher & Schuell, Keene, NH), 20-40 pieces 

30 of transfer paper (Gel Blot Paper, Schleicher & Schuell, Keene, NH), and a 
standardized weight (7 g/cm 2 ). After transfer, the membranes were UV - 
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crosslinked (1,200 mJ), and total RMA capture was assessed by SYBR DX DNA 
Blot Stain (Molecular Probes, Eugene, OR). For experiments to analyze specific 
gene levels, the membranes were pre-hybridized in 6X SSC, 0.5% SDS, 10 jig/ml 
salmon sperm, and 5X Denhardfs, at 55°C for 30 min. Twenty- five tQ 50 ng of 

5 each probe was random-prime labeled using the Rediprime 11 Kit (Amersham 
Pharmacia Biotech, Buckinghamshire, England) and 33p incorporation. The 
membranes were hybridized using 2,000-10,000 cpm/jil, at 55°C in a rotating tube 
overnight, and then washed two times for 1 0 minutes in IX SSC 0.5% SDS, and 
two times for 1 0 minutes in IX SSC, 0. 1 %SDS, and exposed using the 

10 Phosphorimager 445 SI (Microdinamic Engineering, Rockville MD). Quantitation 
of signals was performed using ImageQuant software (IQMac vl .2). A set of 
parallel experiments was also performed under identical conditions, but the stack 
often layered membranes was replaced with a conventional nitrocellulose 
membrane (Protran, Schleicher & Schuell, Keene, NH). Stripping of layered 

15 membranes was performed in a boiling solution of 0.5% SDS for 5-10 min, and 
membranes were subsequently re-hybridized. 

Multiplex Immuno Blots Generation of multiple protein blots was first 
assessed by evaluating total protein staining on ten blots. Protein extract was 
obtained by lysing Jurkat cells in PBS with 1 % SDS. The concentration of protein 

20 was determined by using the BCA Protein Assay Reagent Kit (Pierce, Rockford, 
IL). Twenty \ig and 40 jig of cellular protein extract, and 10 \ig of bovine serum 
albumin (ICN) was separated by polyacrylamide gel electrophoresis (PAGE) on a 
10 % TRIS/HC1 gel (BioRad, Hercules, CA). Transfer from the gel onto ten 
membranes (20/20 GeneSystems, Rockville, MD) was done using the BioMax 

25 MultiBlot Kit for Proteins (Kodak, Rochester, NY) according to the manufacturer's 
recommendation. After transfer, membranes were rinsed in double distilled water 
and stained with the FastBLUE Staining Kit (Chemicon, Temecula, CA). Stained 
membranes were digitized by scanning on UMAX Vista Scan (UMAX 
Technologies Inc, Dallas, TX) and the image was stored in TIFF format. 

30 Quantitation of staining intensity was performed by Kodak ID software (Kodak, 
Rochester, NY). 
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To evaluate total and activated levels of Raf and Erk proteins, cellular 
extracts were made in PBS with 1% SDS from the following cell lines: HaCat, 
Jurkat, HeLa, N1H3T3, SvV480, MCF7, 293T, and S49. Twenty ug of pro tein 
extract was loaded per lane. Samples were separated by PAGE on a 1 0% 
TR1S/HCI gel (BioRad, Hercules, CA) and transferred onto ten membranes (20/20 
Gene Systems, Rockville, MD) using the BioMax MultiBIot Kit for Proteins 
(Kodak, Rochester, NY) according to the manufacturer's recommendation. After 
transfer, membranes were rinsed in TBS buffer (50 mM TRIS pH 8.0, 150 mM 
NaCI, 0.01% Tween-20) and blocked in IX casein solution (Vector Laboratories, 
Burlingame, CA) for 1 5 min. Membranes were then incubated for 8 hours at 40»C 
in a 1 :500 dilution of anti-Raf antibody (Transduction Laboratories, Palo Alto, 
CA), or in a 1 :500 dilution of anti-phospho-Raf (Cell Signaling, Beverly, CA), or 
in a 1:1000 dilution of anti-Erk antibody (Santa Cruz Biotechnologies, Santa Cruz, 
CA), or in a 1 :500 dilution of anti-phospho Erk antibody (Cell Signaling, Beverly 
CA). After incubation, the membranes were washed in TBS buffer and incubated 
in a 1 :2000 dilution of appropriate secondary antibody conjugated to HRP 
(Amersham-Pharmacia, Piscataway, NJ). Proteins were visualized with ECL Plus 
reagent (Amersham-Pharmacia, Piscataway, NJ) and the images captured on 
Kodak Image Station CF440 (Kodak, Rochester, NY). After primary incubation 
with the antibody of interest, all of the membranes were incubated in a 1:1000 
dilution of anti-GAPDH antibody (Chemicon, Temecula, CA) to confirm equal 
loading and transfer from the gel. 

The membranes were found to perform similarly for both RNA and protein 
applications. The quantity and size distribution of RNA that is captured by each of 
the membranes in the system was determined by transferring 15 ug of total RNA 
from MDA-MB-453 cells through the layers. A typical rRNA pattern with both 
28S and 1 8S bands is observed, indicating that the low binding capacity of the 
membranes results in rapid saturation during the transfer process, thus permitting 
the majority of the sample to progress through and bind to subsequent membranes. 
The maximal variability was found to be in membrane #1, which showed a 32% 
increase over the average signal, and in membrane #4 which showed a 23% 
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decrease. This degree of alteration has minimal effect on subsequent probe 
hybridization results, and is within the normal range of experiment-to-experiment 
variability typically observed with standard northern blots. Similar results were 
obtained using protein gels. The difference in intensity signal for all 10 
5 membranes ranged from 12-22% from the average value. 

The hybridization characteristics of the system were evaluated by analyzing 
the signal generated by a GAPDH probe on each often membranes using 15 jig of 
total RNA from the osteosarcoma cell line MG-63. A specific 1 .3 kb band 
corresponding to the GAPDH transcript is seen in each membrane. Similar to 

10 overall RNA levels, only a relatively small variation in detected signal was 

observed between the membranes (ranging from a 41 % decrease from the average 
to a 29% increase). Taken together, the total RNA, protein and GAPDH 
quantitation data demonstrate that the layered array system generates ten 
membranes that reliably bind RNA and protein, and can be successfully probed for 

15 specific genes. 

In addition to its low capacity binding characteristics, the new membrane 
was designed to provide increased hybridization efficiency. Even though it binds 
significantly less RNA or protein than other membranes, the signal intensity after 
probing approaches that obtained in a traditional blot This feature significantly 

20 expands the utility of the system, as investigators can perform multiple northern or 
immuno blots from a single sample preparation, each with a high degree of 
sensitivity. To compare hybridization results between the layered array system and 
a standard blot, 1 5 |ig of total osteosarcoma MG-63 cell line RNA was analyzed by 
both LES and traditional methods in parallel. After the transfer of RNA out of the 

25 gels, all membranes were probed, washed, and imaged under identical conditions. 
Qualitatively, the results obtained were similar between the two blots in terms of 
specificity and hybridization background. Densitometric analysis indicated the 
GAPDH band on the new membrane had a signal equal to 90% of the 
nitrocellulose membrane. In general, the layered array membranes are observed to 

30 produce band intensities ranging from 60-95% of traditional blots for both mRNA 
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and proteins. The subset of membranes that show 40% less intensity have a 
minimal effect on the utility of the system; however, it is desirable in some 
experiments to expose the blots to autoradiography film for an extended period of 
time in order to produce band patterns that are identical to those seen on traditional 
5 blots. 

The ability to perform multiple expression measurements on the same 
sample preparation has significant utility for investigators. For example, different 
members of a signaling pathway can be simultaneously measured, including both 
total protein and the subset of activated protein. This aspect of the invention is 
illustrated by the following experiment. Total protein lysate from eight cell lines 
was separated by gel electrophoresis and ten blots were subsequently produced in a 
single transfer.- Membranes 3-6 were analyzed for total Raf protein, 
phosphorylated Raf, total Erk protein, and phosphorylated Erk, respectively. It is 
found that the ratio of total to activated protein varies among the cell lines, 
indicating that protein expression and activation status are independently regulated. 
Thus, measuring both forms of Raf and Erk proteins is important to an 
understanding of the overall kinetics and activation state of the pathway. 
Multiplex measurements such as these are likely to become increasingly important 
in the future as investigators move toward a systems-based understanding of 
biological processes. 

To assess the uniformity and reproducibility of the layered array system, 20 
lig of HeLa cell line total RNA was analyzed in triplicate using GAPDH, HPV-1 8 
E6/E7, PCNA, and cdc2 probes. The selected target genes are present in HeLa 
cells at varying levels of abundance, thus the membranes could be evaluated across 
25 a 20-fold range of expression. The signals for each lane were quantified and 

compared. The intra-membrane lane variability ranged from a 29% increase from 
the average to a 1 9% decrease, with a median variance of ± 9.3%. These results 
were then compared with the membrane variability of traditional northern blots 
using nitrocellulose membranes. Twenty \ig of HeLa cell line total RNA was run 
30 on three separate electrophoresis gels and subsequently blotted onto nitrocellulose. 
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The first blot was probed with HPV18 E6/E7, the second with PCNA, and the third 
with cdc2. The signal intensity was found to vary among the lanes from a 42% 
percent increase from the average to a 24% decrease, with a median variance of 
13.6%. 

5 The inter-membrane levels of total RNA staining on the three standard 

northern blots was also examined, and compared it with data from the layered 
array membranes. The inter-membrane total RNA levels among the three standard 
northern blots were found to vary among the lanes from a 42% percent increase 
from the average, to a 58% decrease. Therefore, based on both total RNA content 
1 0 and probe hybridization data, the layered membrane array system was found to 
perform similarly to standard northern blots in terms of intra- and inter-blot 
reproducibility. 

The ability of the layered array system to reliably detect relatively small 
differences (2-3 fold) in gene expression levels was evaluated, and compared the 

1 5 data with that from a traditional blot. Thirty fig of total RNA from Jurkat and 
MDA-MB-453 cell lines were separated on an agarose gel and transferred to a 
stack often membranes. PCNA probe was hybridized to membranes 2, 3, 5, 7, and 
9, and GAPDH probe was hybridized to membrane 4. Relative PCNA expression 
between the two cell types was calculated using GAPDH levels to normalize the 

20 amount of RNA loaded on the gel. Each of the five membranes probed for PCNA 
showed higher levels of expression in the Jurkat cells, ranging from a 1 .28- to a 
2.69-fold difference. To compare these results with standard northern blots, two 
30 \ig aliquots of Jurkat and MDA-MB-453 RNA were electrophoresed and 
subsequently transferred to two separate nitrocellulose membranes. Both blots 

25 were probed for PCNA and the band intensities were normalized using total RNA 
levels in the gel. Both blots showed higher levels of PCNA expression in Jurkat 
compared to MDA-MB-453 cells. Blot #1 showed a 1 .56 fold difference and blot 
#2 showed a 3.76 fold difference. Overall, the data indicate that both layered 
membrane arrays and standard northern blots can detect expression level changes 

30 at the 2-3 fold level, and show similar ranges of blot- to-blot variation. 



-21 - 

The experiment was designed to emulate standard experimental approaches 
utilized in the laboratory so that a realistic comparison between the two methods 
could be made. For the layered array system, one typically probes one membrane 
for a housekeeping gene to normalize gel loading, and then performs expression 
5 measurements of additional transcripts or proteins on the remaining layers. As a 
practical matter, this is an optimal use of the system and allows data to be 
generated quickly and efficiently. For standard northern blots, investigators often 
normalize gel loading using total RNA levels in the gel and this is the method 
employed in the experiment described above. Alternatively, one can 

1 0 simultaneously probe a blot against a gene of interest and a housekeeping gene (if 
they are of different sizes), or one can probe for the gene of interest, strip the blot, 
and re-probe for a housekeeping gene. The layered array system was compared 
with standard northern blots using each of these approaches. In each instance, the 
layered membrane system performed as well as standard northern blots in 

1 5 accurately measuring transcript levels. 

The utility of standard northern and immune blots can be increased by 
stripping and reprobing them. While this approach is useful, it has significant 
limitations. The stripping procedure for both protein and RNA blots is harsh, 
typically resulting in decreased hybridization levels and increased background each 

20 time the blot is probed. Anomalous results are occasionally observed after this 
procedure. Such results may be due to damage to the blot and/or, alterations in 
hybridization characteristics of a subset of target molecules. Nevertheless, the 
ability to re-probe blots can be useful in many experiments. Therefore, this 
procedure was evaluated for the new layered membrane system using actin and 

25 GAPDH probes (as model probes). Although this procedure can be applied 

successfully to the layered membranes, a decrease in membrane performance was 
observed after the stripping procedure, similar to that seen with traditional blots. 
For many experiments, its is found that the use of the membrane array system 
eliminates the need for re-probing of blots. 
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The layered membrane technique described herein offers several key 
advantages. It is easy to perform, relies on well-established probe and antibody 
hybridization methods, is an open system that permits investigators to analyze any 
gene or protein of interest, and provides information beyond simple expression 
5 levels such as transcript size(s) and protein processing status. A particularly 

important technical aspect of the method is that all of the blots are produced from a 
single sample, analyzed on one gel. This feature eliminates a number of potential 
experimental artifacts including: variance in sample preparation and solubility in 
loading buffer, problems associated with sample storage and/or freezing-thawing, 
1 0 and the typical variability seen among electrophoretic gel runs. 

Utilization of the multi-blot application of layered expression scanning can 
augment experimental strategies in the laboratory. For example, quantitative 
follow-up measurement of a large set of genes identified in cDNA micro array 
experiments can be performed on a set of samples in a rapid and relatively low- 
1 5 cost fashion. Moreover, investigators can quickly extend their study from a single 
transcript or protein of interest to related genes in a family and/or proteins involved 
in an associated biochemical pathway. Even when investigators do not initially 
need to perform ten separate measurements, it is advantageous to store the 
membranes for future experiments. 

20 The layered array system has been shown to be robust and reliable. The 

method permits investigators to produce usable blots from each RNA or protein 
gel, thus significantly increasing their utility. 

The EMI devices described of the invention may significantly extend the 
future capabilities of LES technology. As an example, EMI devices may be 
25 employed to add an amplification step (e.g., PCR, etc.) to the RNA gel or tissue 
application. The amplification is performed on the entire transcriptome, after the 
mRNA has been separated by gel electrophoresis, but prior to transfer through the 
membrane layers, using a method similar to in-shu PCR of a tissue section. This 
approach will increase the sensitivity level of each blot, as well as the number of 
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blots that can be created per gel. Moreover, PCR-based layered array membranes 
may permit multiplex, hybridization-based analysis of developmental biology or 
micro-dissected cell samples that typically do not produce enough RNA for 
standard blotting techniques. Thus, the LES layered membrane array methodology 
has immediate utility for multiplex mRNA and protein expression measurements, 
particularly in combination with an EMI device. 

Example 4 

Microarrays: Post-Analysis Follow-up and Validation 
Two important questions for investigators to consider when evaluating 
microarray expression data: are whetiier the results are valid (i.e., accurate) for the 
specific biological system under study, and whether the data fundamentally 
describe the phenomenon being investigated. 

Obtaining valid expression measurements may involve more than just post- 
array verification of results using an independent laboratory approach. 
Introduction of artifact is possible at any time during an array experiment, thus 
each component of the procedure is to be carefully considered. Overall, the 
validation process may be divided into three areas: experimental quality control, 
independent confirmation of data, and universality of results. 

Microarray Experiment Quality Control Optimizing array experiments 
at the front-end decreases the time and effort required to subsequently invalidate 
erroneous expression results. Briefly, the following parameters may significantly 
affect the accuracy of array experiments. Of significant importance in eliminating 
"noise" in the data are repeat experiments. Multiple arrays should be performed, 
including replicates of each RNA sample, as well as with independent RNA 
preparations. Additionally, "up-front validation" can be performed by including 
separate regions of each gene on the array. This allows for multiple measurements 
of expression as part of the initial experiment. An important step that affects the 
validity of the downstream data is image acquisition. There are several commercial 
software packages available, in addition to programs provided online at no cost to 
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investigators. Factors that can affect results include the methods for normalization 
and background subtraction, data processing and standardization, and use of 
visualization tools (Brazma, A. et al. (2001) "Minimum Information About A 
Micro Array Experiment (MIAME)-Toward Standards For Micro Array 
5 Data," Nat. Genet 29:365-371). 

Selecting statistical approaches can be problematic due to* the large number 
of analysis systems available. The micro array field is rapidly evolving, and the 
number of publications on methods is substantial (Hess, K.R. et al (2001) 

"MiCROARRAYS: HANDLING THE DELUGE OF DATA AND EXTRACTING RELIABLE 

i0 Information," Trends in Biotech. 19:463-468; Pan, W. (2002) "A Comparative 
Review Of Statistical Methods For Discovering Differentially 
Expressed Genes In Replicated Micro Array Experiments," Bioinformatics 
1 8:546-554; Nadon, R. et al (2002) "Statistical Issues With Microarra ys: 
Processing And Analysis," Trends in Genet 18:265-271). To date, there is no 

15 consensus approach to statistical analysis and thus array results are analyzed in a 
variety of different ways. However, at a minimum, there are basic methods that 
may be applied. Numerical management of the data permits removal of artifacts 
caused by low gene expression and low ratios (for brief review, see Mills, J.C. et 

dL (2001) "MiCROARRAYS AND BEYOND: COMPLETING THE JOURNEY FROM TISSUE 

20 To Cell," Nat Cell Biol. 3:175-178). Following data pre-processing and 

numerical management, a statistical approach must be chosen to determine gene 
significance (as an example, see Mutch, D.M. et al. (2001) "Microarray Data 
Analysis: A Practical Approach For Selecting Differentially Expressed 
Genes," Genome Biol. 2, preprint 0009). Finally, high-end computational analysis 

25 may be employed such as clustering, multidimensional scaling, or pattern 

identification, including neural networks and heuristic algorithms (Ellis, M. et al 
(200 1 ) "Development And Validation Of A Method For Using Breast Core 
Needle Biopsies For Gene Expression Micro Array Analyses," Clin. Cancer 
Res. 8:1 155-1 166). The goal of all of these efforts is accurate identification of 

30 differences in gene expression between the sample sets, and maximal use of the 
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informatipn toward a better understanding of the biological processes) under 
study. 

Independent Confirmation of Results There are two approaches to 
independent confirmation of micro array data; in silica analysis, and labbratory- 
5 based analysis. The in silica method utilizes a comparison of array results with 
information available in the literature, as well as public or private expression 
databases, and provides the opportunity to validate data without further 
experimentation. For example, multiple studies of prostate cancer profiling have 
been published (Luo, J. et al. (2001) "Human Prostate Cancer And Benign 

1 0 Prostatic Hyperplasia: Molecular Dissection By Gene Expression 

PROFILING," Cancer Res. 61:4683-4688; Welsh, J.B. et al. (2001) "ANALYSIS OF 
Gene Expression Identifies Candidate Markers And Pharmacological 
TARGETS IN PROSTATE CANCER," Cancer Res. 6 1 =5974-5978; LaTulippe, E. et al. 
(2002) "COMPREHENSIVE GENE EXPRESSION ANALYSIS OF PROSTATE CANCER 

1 5 REVEALS DISTINCT TRANSCRIPTIONAL PROGRAMS ASSOCIATED WITH METASTATIC 
DISEASE," Cancer Res. 62:4499-4506; Dhanasekaran, S.M. et al. (2001) 
"DELINEATION OF PROGNOSTIC BlOMARKERS IN PROSTATE CANCER," Nature 

412:822-826; Emmert-Buck, M.R et al. (2000) "Molecular Profiling Of 
Clinical Tissue Specimens: Feasibility And Applications," Am. J. of Pathol. 

20 1 56: 11 09-1 1 15; Magee, J.A. et al. (2001) "Expression Profiling Reveals 
Hepsin Overexpression In Prostate Cancer," Cancer Res. 61 .5692-5696; 
Stamey, T.A. et al. (2001 ) "Molecular Genetic Profiling Of Gleason Grade 
4/5 Prostate Cancers Compared To Benign Prostatic Hyperplasia," J. Urol. 
166:2171-2177; Ahram, M. et al. (2001) "PROTEOMIC ANALYSIS OF Human 

25 Prostate Cancer," Mol. Carcin. 33: 9-15; Singh, D. etal. (2002) "Gene 

Expression Correlates Of Clinical Prostate Cancer," Cancer Cell l :203- 
209; Svaren, J. et al. (2000) "EGR1 Target Genes In Prostate Carcinoma 
Cells Identified By Microarray Analysis," J. Bio!. Chem. 275:38524-3853 1 ; 
Chaib, H. et al. (2001) "Profiling And Verification Of Gene Expression 

30 Patterns In Normal And Malignant Human Prostate Tissues By cDNA 
Microarray Analysis," Neoplasia 3:43-52; Porkka, K. etal. (2002) 
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" AMPLIFICATION AND OVEREXPRESSION OF EtONGIN C GENE DISCOVERED IN 

Prostate Cancer By cDNA Microarrays," Lab. Invest. 82:629-637; Mousses, 
S. et al (2002) "Clinical Validation Of Candidate Genes Associated With 
Prostate Cancer Progression In The C WR22 Model System Using Tissue 
5 MICROARRAYS," Cancer Res. 62:1256-1260 (2002). In a meta-analysis of the data 
sets from four of these papers, several differentially expressed genes were found to 
be common to the majority of the studies, the serine protease hepsin as an example* 
(Rhodes, D. etal (2002) "Meta-Analysis Of Microarrays: Interstudy 
Validation Of Gene Expression Profiles Reveals Pathway Dysregulation 

1 0 In Prostate Cancer," Cancer Res. 62:4427-4433). Moreover, some of these 
genes, such as glutathione-S-transferase, have previously been identified as 
aberrantly expressed in prostate cancer in studies using methods other than 
microarrays (Canada, A.T. et al (1996) "Glutathione And Glutathione S- 
Transferase In Benign And Malignant Prostate Cell Lines And Prostate 

1 5 TISSUES," Biochem. Pharm. 5 1 :87-90). Agreement between array results from 
other groups, as well as with known expression information in the literature, 
validates die general performance of a system and provides confidence in the 
overall data, including the unique and novel discoveries made in a study. It is 
likely that the in silica approach to validation will become more useful when 

20 standardized methods for reporting array data, such as the MIAME format 
(Minimal Information About a Microarray Experiment) are uniformly applied 
(Brazma, A. et al. (2001 ) "Minimum Information About A Micro Array 
Experiment (MIAME)-Toward Standards For Micro Array Data," Nat 
Genet. 29:365-371). 

25 Laboratory-based data validation provides independent, experimental 

verification of gene expression levels, and typically begins with the same samples 
that were studied in the initial array experiment(s). The methodology employed 
varies depending upon the scientific question posed, but commonly used 
techniques involving mRNA include: 
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Semi-Quantitative RT-PCR and real-time RT-PCR (see, for example, 
Al Moustafa, A.E. et al. (2002) "IDENTIFICATION OF GENES ASSOCIATED WITH 

Head And Neck Carcinogenesis By cDNA Microanay Comparison Between 
Matched Primary Normal Epithelial And Squamous Carcinoma Cells," 
5 Oncogene 21 :2634-2640; Chaib, H. et al. (2001) "Profiling And Verification 
Of Gene Expression Patterns In Normal And Malignant Human Prostate 
TISSUES By CDNA MiCROARRAY Analysis," Neoplasia 3:43-52; Luo, J. el al: 
(2001) "Human Prostate Cancer And Benign Prostatic Hyperplasia: 
Molecular Dissection By Gene Expression Profiling," Cancer Res. 6 1 :4683- 

10 4688; Luo, J.H. et al. (2002) "Gene EXPRESSION ANALYSIS OF PROSTATE 

CANCER," MoI. Carcin. 33:2535; Bangur, C.S. etal. (2002) "IDENTIFICATION OF 
Genes Over-Expressed In Small Cell Lung Carcinoma Using Suppression 
Subtractive Hybridization And cDNA Microarray Expression Analysis," 
Oncogene 21:3814-3825; Jiang, Y. etal. (2002) "Discovery Of Differentially 

1 5 Expressed Genes In Human Breast Cancer Using Subtracted cDNA 

Libraries And cDNA Microarrays," Oncogene 21 :2270-2282; Alevizos, I. et 
al. (2001 ) "Oral Cancer In Vivo Gene Expression Profiling Assisted By 
Laser Capture Microdissection And Microarray analysis," Oncogene 
20:6196-6204; Scheidl, S.J. etal. (2002) "mRNA Expression Profiling Of 

20 Laser Microbeam Microdissected Cells From Slender Embryonic 

STRUCTURES," Amer. J. of Pathol. 160:801-813; Svaren, J. etal. (2000) "EGR1 
Target Genes In Prostate Carcinoma Cells Identified By Microarray 
Analysis," J. Biol. Chem. 275:38524-3853 1 ); 

Northern Analysis (see, for example, Taniguchi, M. et al. (2001) 
25 "Quantitative Assessment Of DNA Microarrays-Comparison With 
Northern Blot Analyses," Genomics 7 1 :34-39; Clark, J. etal. (2002) 
"Identification Of Amplified And Expressed Genes In Breast Cancer By 
Comparative Hybridization Onto Micro Arrays Of Randomly Selected 
CDNA Clones," Genes, Chrom. and Cancer 34:104-1 14; Amundson, S.A. et al. 
30 ( 1 999) "Fluorescent cDNA Microarray Hybridization Reveals 

Complexity And Heterogeneity Of Cellular Genotoxic Responses," 
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Oncogene 18:3666-3672; Chaib, H. etal. (2001) "PROFILING AND VERIFICATION 
Of Gene Expression Patterns In Normal And Malignant Human Prostate 
Tissues By cDNA Microarray Analysis," Neoplasia 3:43-52; Dhanasekaran, 
S.M. et al. (2001) "Delineation Of Prognostic Biomarkers In Prostate 
5 CANCER," Nature 412:822-826); 

Ribonuclease Protection Assay (Taniguchi, M. et al. (2001) 
"Quantitative assessment Of DNA Microarrays-Comparison With 
Northern Blot Analyses," Genomics 71:34-39; Hodge, D.L. et al. (2002) "IL-2 
And IH 2 Alter NK Cell Responsiveness To IFN-Gammainducible Protein 
10 1 0 By Down-Regulating CXCR3 Expression," J. Immunol. 168:60906098; 

Saban, M.R. et al. (2001) "TIME COURSE OF LPS-lNDUCED GENE EXPRESSION IN A 
Mouse Model Of Genitourinary Inflammation," Physiol. Genomics 5:147- 
160); and 

In Situ Hybridization (ISH) Or Immunobistochemistry (IHC) Using 
15 Tissue Microarrays (Bonaventure, P. et al. (2002) "NUCLEI AND SUBNUCLEI 
• GENE EXPRESSION PROFILING IN MAMMALIAN BRAIN," Brain Res. 943:38-47; Qi, 
Z.Y. et al (2002) "Isolation Of Novel Differentially expressed. Genes 
related To Human Glioma Using cDNA Migro Array And 
Characterizations Of Two Novel Full-Length Genes," J. Neurooncol. 
20 56: 1 97-208; Porkka, K. et al. (2002) "AMPLIFICATION AND OvEREXPRESSION OF 
Elongin C Gene Discovered In Prostate Cancer By cDN A Microarrays," 
Lab. Invest. 82:629-637; Mousses, S. et al. (2002) "Clinical Validation Of 
Candidate Genes Associated With prostate Cancer Progression InThe 
• CWR22 model System Using Tissue Microarrays," Cancer Res. 62:1256- 
25 1 260). Commonly used techniques involving Proteins include immunoblots and 
immunohistochemistry (IHC) (A l Moustafa, A.E. et al. (2002) "Identification 
Of Genes Associated With Head And Neck Carcinogenesis By cDNA 
Microanay Comparison Between Matched Primary Normal Epithelial 
And SQUAMOUS Carcinoma Cells," Oncogene 21 :263 4-2640; Burton, G.R. et 
30 (2002) " Microarray Analysis Of Gene Expression During Early 
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Adipocyte Differentiation," Gene 293:21-3 1 ) and/or BHC via tissue 
microarrays (Dhanasekaran, S.M. et al (2001) "Delineation Of Prognostic 
Biomarkers In Prostate Cancer," Nature 412:822-826; Mousses, S. et al 
(2002) "Clinical Validation Of Candidate Genes Associated With 
5 Prostate Cancer Progression In The C WR22 Model System Using Tissue 
Microarrays," Cancer Res. 62:1256-1260). 

Commonly used methods involving proteins include: 

Immunoblot Al Moustafa, A.E. et al (2002) "Identification Of Genes 
Associated With Head And Neck Carcinogenesis By cDNA Microanay 
1 0 Comparison Between Matched Primary Normal Epithelial And Squamous 
Carcinoma Cells," Oncogene 21 :2634-2640; Burton, G.R. et al (2002) " 
Microarray Analysis Of Gene Expression During Early Adipocyte 
Differentiation," Gene 293:21-31; and 

Immunohistochemistry (BttQ and/or IHC via tissue microarrays 
1 5 Dhanasekaran, S.M. et al (200 1 ) "Delineation Of Prognostic Biomarkers In 
Prostate Cancer," Nature 412:822-826; Mousses, S. et al (2002) "Clinical 
Validation Of Candidate Genes Associated With Prostate Cancer 
Progression In The CWR22 Model System Using Tissue Microarrays," 
Cancer Res. 62:1256-1260 (2002). 

20 See, in particular, Luo, J. et aL (2001) "Human Prostate Cancer And 

Benign Prostatic Hyperplasia: Molecular Dissection By Gene Expression 
Profiling," Cancer Res. 61 :4683-4688; Dhanasekaran, S.M. et al (2001) 
"Delineation Of Prognostic Biomarkers In Prostate Cancer," Nature 
412:822-826; Stamey, T.A. et al (200 1 ) "MOLECULAR GENETIC PROFILING OF 

25 Gleason Grade 4/5 Prostate Cancers Compared To Benign Prostatic 

Hyperplasia,"! Urol. 166:2171-2177; Singh, D. etal. (2002) "Gene Expression 
Correlates Of Clinical Prostate Cancer," Cancer Cell 1 :203-209; Al 
Moustafa, A.E. et al (2002) "Identification Of Genes Associated With Head 
And neck Carcinogenesis By cDN A Microanay Comparison Between 
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Matched primary Normal Epithelial And Squamous Carcinoma Cells," 
Oncogene 21 :2634-2640; Bangur, C.S. et al. (2002) "Identification Of Genes 
Over-Expressed In Small Cell Lung Carcinoma Using Suppression 
SuBTRAcnvE Hybridization And cdna Microarray Expression Analysis," 
5 Oncogene 21:3814-3825; Alevizos, I. et al. (2001) "Oral Cancer In Vivo Gene 
Expression Profiling Assisted By Laser Capture Microdissection And 
Microarray Analysis," Oncogene 20:6196-6204; Luo, J.H. et al. (2002) "Gene 
Expression analysis Of Prostate Cancer," Mol. Carcin. 33:2535; Jiang, Y. et 
al. (2002) "Discovery Of Differentially Expressed Genes In Human Breast 

1 0 cancer Using Subtracted cDNA Libraries And cDNA Microarrays," 
Oncogene 21 :2270-2282; Scheidl, S.J. et al. (2002) "MRNA EXPRESSION 
Profiling Of Laser Microbeam Microdissected Cells From Slender 
- Embryonic Structures," Amer. J. of Pathol. 160:801-81 3; Amundson, S A. et 
al. (1999) "Fluorescent cDNA Microarray Hybridization Reveals 

1 5 Complexity And Heterogeneity Of Cellular Genotoxic Responses," 
Oncogene 1 8:3666-3672; Clark, J. et al. (2002) "Identification Of Amplified 
And Expressed Genes In Breast Cancer by Comparative Hybridization 
Onto Micro Arrays Of Randomly Selected cDNA Clones," Genes, Chrom. 
and Cancer 34:104-1 14; Rajeevan, M.S. etal. (2001) "Validation OfArray- 

20 based Gene Expression Profiles Bv Real-Time (Kinetic) RT-PCR," J. of 
Molec. Diagnos. 3:26-31; Ross, D.T. et al. (2000) "Systematic Variation In 
Gene Expression Patterns In Human Cancer Cell Lines," Nat Genet 
24:227-235; Taniguchi, M. et al. (2001) "QUANTITATIVE ASSESSMENT OF DNA 
MlCROARRAYS-COMPARISON WITH NORTHERN BLOT ANALYSES," Genomics 

25 71 :34-39; Sgroi, D.C. et al. (l 999) "In Vivo Gene Expression Profile Analysis 
Of Human Breast Cancer Progression," Cancer Res. 59:5656-566 1 ; Vu, H.L. 

et al (2000) "A METHOD FOR QUANTIFICATION OF ABSOLUTE AMOUNTS OF 
NUCLEIC ACIDS BY (RT)-PCR AND A NEW MATHEMATICAL MODEL FOR DATA 

Analysis," Nucleic Acids Res. 28:E18; Nadu, M. etal. (2001) "Molecular 
30 Characteristics Of Non-Small Cell Lung Cancer," Proc. Natl. Acad. Sci. 
USA 98:1 5203-15208). Real-time RT-PCR is the choice of many for 
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quantitatively measuring specific mRNAs as, once established, the method is rapid, 
relatively inexpensive and requires minimal starting template (Rajeevan, M.S. et 
al. (2001 ) "Validation Of Arraybased Gene Expression Profiles By Real- 
Time (Kinetic) RT-PCR," J. of Molec. Diagnos. 3:26-31; Walker, N. (2002) "A 
5 Technique Whose Time Has Come," Science 296:557-558. However, it should 
be noted that real-time RT-PCR requires a significant amount of up-front effort to 
optimize amplification conditions, and there are potential pitfalls of the method 
that should be carefully monitored in order to obtain optimal results. 

Comparisons of array-based results with northern blots reveal good general 
1 0 agreement between the methods, although arrays were less sensitive in measuring a 
subset of genes (Taniguchi, M. et al. (200 1) "Quantitative Assessment Of DNA 

MlCROARRAYS-COMPARISON WITH NORTHERN BLOT ANALYSES," Genomics 

71 :34-39). Rajeevan and co-workers used a modified real-time R T - PCR method 

to evaluate array data (Rajeevan, M.S. et al. (20oi) "Validation Of Arraybased 

1 5 Gene Expression Profiles By Real-Time (Kinetic) RT-PCR," J. of Molec. 

Diagnos. 3:26-31). These investigators also found that the majority of array results 
were qualitatively accurate; however, consistent validation was not achieved for 
genes showing less than a four-fold difference on the array. Moreover, for many 
genes examined, there were significant quantitative differences between array- and 

20 RT-PCR-based data. Overall, a review of the literature suggests that several 
technical questions regarding methods used for validation have yet to be 
thoroughly and rigorously addressed by the array community. 

The selection of the gene set for follow-up analysis in the laboratory 
depends on the aim(s) of the study, but is influenced by factors such as the relative 

25 difference in expression among the samples, biological function, abundance levels, 
and availability of appropriate reagents (probes and antibodies). Investigators often 
choose the genes with the highest differential expression ratios, as these are most 
likely to be validated. However, since quantitative information from arrays may be 
imprecise for transcripts showing small expression differences, this strategy could 

30 overlook significant genes of interest. In the future, it is likely that development of 
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more robust and quantitative array platforms will increase the confidence that 
genes exhibiting relatively small expression difference among samples are accurate 
and thus worth further investigation. In the near term, genes on an array that show 
significant differences in expression may serve as important clues and point 
5 ' investigators towards biochemical pathways whose members should be studied in 
detail using more quantitative methods. 

In addition to validating array results at the mRNA level, it is equally 
desirable to evaluate expression levels of the corresponding protein products. At 
present, the frequency with which protein expression equates with transcript levels 

10 as measured on arrays is not clear. A poll of users of tissue micro arrays produced 
by the National Cancer Institute fwww.cancer.gov/tarp) indicates that protein 
expression changes correlate with mRNA alterations less than 50% of the time. 
However, there are several caveats to consider that may be responsible for this 
discrepancy, including; the sensitivity and dynamic range of the methodology 

1 5 employed, the specificity of the antibody probe, the absolute difference in protein 
levels as compared to mRlMA levels, and the abundance level of the protein being 
analyzed. Moreover, as indicated above, investigators often bias their selection of 
mRNAs for follow-up analysis towards those that show the largest fold-change. 
This may not necessarily translate to a similar difference at the protein level, 

20 particularly since protein function in the cell is affected by several parameters 
besides abundance. 

Universality of Results Once array data have been analyzed and 
independently verified, either in silica or in the laboratory, investigators can 
determine if the expression profiles are a universal feature of the biological 
25 phenomenon under study. This question can be addressed by evaluating a critical 
gene set in a larger and more extensive study group, and can be performed either in 
. silica or in the laboratory. For example, an investigator may identify a particular 
expression pattern in a breast cancer cell line after treatment with a drug. 
Subsequent experiments to evaluate the expression profiles of additional breast 
30 cancer cell lines are then desirable to determine if this finding represents a general 
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feature of breast tumor cells. If so, a logical next step would be to examine the 
expression profiles in clinical samples from patients being treated with the drug. 
These experiments are important to validate or invalidate the data generated with 
an in vitro model system. Moreover, correlation of the gene expression pattern may 
5 also be made with respect to additional clinical parameters, such as the frequency 
with which patients show the identified profile, patient age, disease stage, and 
tumor histopathology (see, Singh, D. et al (2002) "GENE EXPRESSION 
Correlates Of Clinical Prostate Cancer," Cancer Cell l :203-209) 

Tissue micro arrays (TMA) are an excellent approach for validation of 

10 array data in a large set of human or animal species (Dhanasekaran, S.M. et aL 
(2001) "Delineation Of Prognostic Biomarkers In Prostate Cancer," 
Nature 4 1 2:822-826; Bangur, C.S. et al (2002) IDENTIFICATION OF GENES Over- 
Expressed In Small Cell Lung Carcinoma Using Suppression Subtractive 
Hybridization And cDNA Microarray Expression Analysis," Oncogene 

15 21:3814-3825; Kononen, J. etal (1998) 'Tissue Micro Arrays For High- 
Throughput Molecular Profiling Of Tumor Specimens," Nat Med 4:844- 
847; Kallioniemi, O.P. et al (2001) "Tissue Microarray Technology For 
High-Throughput Molecular Profiling Of Cancer," Human Molec. Genet 
1 0:657-662; Hoos, A. et al 'Tissue Micro Array Profiling Of Cancer 

20 Specimens And Cell Lines: Opportunities And Limitations," Lab. Invest. 
81:1331-1338; Fejzo, M. et al (2001) "Frozen Tumor Tissue Micro Array 
Technology For Analysis Of Tumor RN A, DNA, And Proteins," Am. J. 
Pathol. 159:1645-1650; Bubendorf, L.etal (1999) "HORMONE THERAPY FAILURE 
In Human Prostate Cancer: Analysis By Complementary DNA And Tissue 

25 Microarrays," J. Natl. Cane. Inst 91 -.17581764; Battifora, H. (1986) "The 
Multitumor (Sausage) Tissue Block: Novel method For 
Immunohistochemical Antibody Testing," Lab. Invest. 55:244-248). A TMA 
is a slide containing dozens to hundreds of pre-defined microscopic sections of 
tissue, making it feasible for an investigator to measure DNA, mRNA or protein 

30 expression in a large number of samples, providing enough statistical power for 
meaningful analysis, A particular advantage of the method is that both same- 
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sample and new-sample analysis can be performed using the same TMA. A related 
approach is the "tissue lysate array," where lysates from cell populations collected 
by laser capture microdissection (LCM) are arrayed on a nylon coated slide 
(Emmert-Buck, M.R. et al (1996) "Laser Capture Microdissection," Science 
274:998-1001; Bonner, R.F. etaL (1997) "Laser Capture Microdissection: 
Molecular Analysis Of Tissue," Science 278: 148 1 -1 483; Paweletz, CP. et al. 
(200 1 ) "Reverse Phase Protein Micro Arrays Which Capture Disease 
Progression Show Activation Of Pro-Survival Pathways At The Cancer 
Invasion Front," Oncogene 20:1981-1989). The most common methodology 
applied to TMAs is IHC, but increasingly ISH is being used, although sensitivity 
remains a challenging issue for ISH and frequently requires the use of radioactive 
probes for detection. Other techniques applicable to TMAs include fluorescence 
in-situ hybridization (FISH) for chrqmosomal copy number, and 
immunofluorescence in combination with confocal microscopy. 

Like any experimental platform, TMAs are not without limitation. The 
most significant drawbacks involve sensitivity, lack of quantitation, and potential 
selection bias of the tissue samples. Both IHC and ISH are qualitative/semi- 
quantitative. With several hundred tissue specimens per TMA slide, the 
requirement for a pathologist or experienced investigator to view and score each 
sample is problematic. Robust methods to analyze TMA slides in an automated 
fashion are facilitated by the development of a TMA database exchange 
(http^wv.lw.nathinfo.comAih/h na fi, n „i Selection bias is also a potential 
problem for TMAs due to the small size of the tissue core that is removed from the 
donor specimen. In some instances, the issue is of little concern as the expression 
levels of certain mRNAs and proteins are relatively uniform throughout the tissue. 
However, tissues are complex, multicellular entities that can contain significant 
intra-specimen molecular heterogeneity, particularly with respect to evolving 
disease processes. Therefore, a single small punch of tissue may not necessarily be 
representative of the overall state of the organ. Moreover, many small, but 
critically important structures (normal and disease-associated), are often sectioned 
through within the first few slides generated from a TMA block, and are thus 
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unavailable for further study. Layered Expression Scanning, as described above, 
overcomes many problems related to sample bias. 

Issues of cross-hybridization and RNA amplification related to validation 
of array data merit further discussion: 

5 Non-specific and Cross Hybridization Two phenomena of cDNA 

microarrays related to target-probe hybridization have been observed (Emmert- 
Buck, M.R et al (2000) "Molecular Profiling Of Clinical Tissue Specimens: 
Feasibility And Applications," Am. J. of Pathol. 156:1 109-1 1 15). With many 
array systems, a significant number of arrayed DNAs produce "non-specific" 

10 background signals during experiments, mediated by repetitive elements, polyA 
tails, common motifs, or other unknown sequence-dependent regions of the DNA. 
When co-hybridized with two cDNA samples labeled with different fluorophores, 
these arrayed DNAs produce (often strong) signals that are interpreted as "equally 
expressed" in the biological samples under study. There are several important 

15 implications of this occurrence. For example, it can mislead investigators with 
respect to how many genes are actually being measured in an experiment. This is 
particularly important when assessing the effectiveness of an amplification 
scheme. Tf the criterion used to assess amplification efficiency is correlation of 
array results between amplified and non-amplified samples, and a large proportion 

20 of the similarity is due to non-specific hybridization, then the correlation 
coefficient between the ± amplification samples will be artificially high. 
Moreover, if this phenomenon is not factored into validation studies, investigators 
may find a significant discrepancy between array data and the expression results 
generated in subsequent follow-up efforts. 

25 Non-specific signals on an array can also mislead investigators who are 

interested in the overall (in)activation state of a particular biochemical pathway. 
When comparing two samples, it is not only important to know which genes in a 
pathway are differentially expressed, it is also important to know which genes are 
not expressed. Erroneous data will compromise efforts to understand how each of 
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the members of a pathway is regulated with respect to a particular cellular process. 
Finally, correlation of mRNA levels measured on an array with corresponding 
proteins will appear artificially low if, in fact, a substantial fraction of the 
"expressed genes" are due to experimental artifact. This can have important 
5 implications for eiTorts in which both protein expression — and lack of expression - 
- are important. An example would be a search of candidate proteins for a tumor 
vaccine. Investigators often screen mRNA databases generated from arrays as an 
initial guide, with the intent to find proteins that are present in a target disease, but 
are not in normal cells and organs throughout the body. In this instance, non- 
10 specific array signals will make it appear that a significant number of genes are 
expressed widely in tissues when, in reality, their profile may be much more 
limited. 

The second observed phenomenon is that a subset of target cDNAs will 
hybridize strongly to both their intended DNA probe, as well as other DNA probes 

15 on an array, ranging from a few to several dozen. In some cases, this can be 

understood based on sequence homologies among gene family members; however, 
it has been empirically observed that certain cDNAs will hybridize to arrayed 
DNA that do not share an easily identifiable common sequence. When these 
cDNAs are present at high copy number in a sample, they can produce artifactual 

20 data based on cross-hybridization. This phenomenon impacts upon investigators 
performing cluster analysis (i.e., examining gene sets that appear to share similar 
expression patterns and co-segregate during experiments). It is important that ' 
subsequent validation studies are designed with this possibility in mind. For 
example, follow-up studies using 1SH analysis of TMAs could be similarly 

25 compromised by target-probe cross-hybridization. Thus, array cluster results 
should be verified using a methodology that does not rely on hybridization, or 
alternatively, using a northern blot where hybridization specificity can be evaluated 
based on transcript size. Additionally, to further validate their data, investigators 
can compare and contrast their array-based results in silico with expression 

30 information from sequencing-based profiling efforts that do not rely on 
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hybridization methods, for example, expressed sequence tag (EST) or serial 
analysis of gene expression (SAGE) data sets. 

Amplification of mRNA Samples One of the goals of the NCI's Cancer 
Genome Anatomy Project (CGAP) is to assess the feasibility of producing a 
5 complex transcriptome from small numbers of microdissected cells 

(http://cgap.nci.nih.gOv/l 1 2.47.Afurtheraimwas to evaluate the effects of PCR 
amplification on the mRNA population. Overall, it was observed that PCR induced 
a bias in transcript levels to a varying degree, depending on the amplification 
scheme, cycle number, and primer set However, it was also found that 

1 0 "intentionally biasing" the transcriptome with PCR was useful for discovering 
novel expression differences between cell populations. The newly discovered 
genes were typically expressed at low abundance levels and were not identified in 
experiments using non-biased cDNA. Thus, as a general strategy, it may be 
necessary to include a selective amplification step in some array experiments such 

1 5 that low-abundance transcripts can be studied. However, if this strategy is utilized, 
investigators must then be aware that subsequent validation experiments will be 
more challenging, and may also require PCR-based approaches. In other words, the 
identified transcripts of interest may be difficult to measure using northern blots or 
other techniques that do not incorporate an amplification step. Moreover, 

20 evaluation of the corresponding protein products may be particularly demanding if 
they are similarly expressed at low levels in the biological samples under study. 

Even though mRNA amplification may induce some bias in the 
transcriptome, several investigators have successfully used this approach to 
perform assay experiments (Sgroi, D.C. et al (1999) "In Vivo Gene Expression 

25 Profile Analysis Of Human Breast Cancer Progression," Cancer Res. 
59:5656-5661; Cole, K.A., etal. (1999) "THE GENETICS Of CANCER- A 3D 
Model," Nat. Genet. 21:38-41; Luo,L. etal. (1999) "Gene Expression Profiles 
Of Laser-Captured Adjacent Neuronal Subtypes," Nat Med. 5:1 17-122; 
Luzzi, V. et al (2001) "Expression Profiling Of Ductal Carcinoma In Situ 

30 By Laser Capture Microdissection And High-Density Oligonucleotide 
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Arrays," Amer. L. Pathol. 158:2005-2010; Leethanakul, C. etal. (2000) 
"Distinct Pattern Of Expression Of Differentiation And Growth-Related 
Genes In Squamous Cell Carcinomas Of The Head And Neck Revealed By 
The Use Of Laser Capture Microdissection And cDNA Arrays," Oncogene 
1 9:3220-3224; Van Gelder, R. et al. (1990) "Amplified RNA Synthesized From 
Limited Quantities Of Heterogeneous cDN A," Proc. Natl. Acad. Set USA 
87:1 663-1667). Optimizing the method such that bias is consistent and 
reproducible is important, and allows relative comparisons of transcript levels 
between similarly prepared mRNA samples. However, one of the frequent 
problems that is observed is the inefficient priming of particular mRNA(s) early in 
the amplification process. This appears to occur randomly to individual 
transcripts. One approach to overcome this problem is to perform multiple (e.g., 3) 
independent amplifications of the starting mRNA, and subsequently pool the 
cDNA together for array analysis. Since drop-out occurs indiscriminately, it is 
unlikely that the same transcript will fail to amplify in more than one reaction. This 
approach is a simple means to reduce artifact and increases the percentage of 
expression differences that validate in follow-up studies. 

The use of micro array and other global profiling technologies has lead to a 
significant number of exciting new biological discoveries, and important 

20 correlation between gene expression patterns and disease states. Nonetheless, it is 
important that investigators continue to optimize array methodologies, and develop 
new approaches to producing accurate and experimentally valid data. The 
techniques of "Expression Microdissection" (also referred to herein as Labeling- 
based ANalysis of Cells or "LANC"), and Layered Expression Scanning (LES) 

25 address this need. Both methods are conceptually simple and are intended to 
increase the throughput rate of experimentation, while decreasing the time and 
effort required of the researcher. 

LES allows for multiplex measurement of transcripts or proteins in a 
variety of two-dimensional life science platforms (Englert, C.R. et al. (2000) 
30 "Layered Expression Scanning: Rapid Molecular Profiling Of Tumor 
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Samples," Cancer Res. 60: 1 526- 1 530. LES can be used to analyze gels, tissue 
sections, and cell populations recovered by LCM. The method is intended to 
facilitate the transition from global profiling efforts towards defined studies of 
biochemical pathways that are identified as important in array- and proteomic- 
5 based efforts. Two applications of LES technology are being used to validate 
expression data from the study of prostate cancer (Cole, K.A., et al. (1999) 'The 
Genetics Of Cancer- A 3D Model," Nat. Genet 21 :38-41). The replicate gel 
approach for multiplex northern gels and immunoblots is being utilized for follow- 
up analysis of tumor-related mRNA and protein alterations, respectively. These 
10 blots permit robust quantitative measurement of expression levels, and include 
verification of probe specificity based on the molecular size of transcripts and 
proteins. In parallel, the mRNA and protein levels in whole-mount prostatectomy 
specimens are being analyzed using LES to validate and further characterize 
expression levels of genes identified in mRNA and protein profiling experiments 
1 5 (Emmert-Buck, M.R et al. (2000) "Molecular Profiling Of Clinical Tissue 
Specimens: Feasibility And Applications," Am. J. of Pathol. 156:1109-1115; 
Ahram, M. et al. (2001) "Proteomic Analysis Of Human Prostate Cancer," 
Mol. Carcin. 33: 9-15; Paweletz, CP. etal. (2001) "Reverse Phase Protein 
Micro Arrays Which Capture Disease Progression Show Activation Of 
20 Pro-Survival Pathways At The Cancer Invasion Front," Oncogene 
20: 1981-1989; Cole, K.A., et al. (1999) "The Genetics Of Cancer- A 3D 
Model," Nat Genet. 2 1 :38-4 1 ; Emmert-Buck, M.R et al. (2000) "An Approach 
To Proteomic Analysis Of Human Tumors," Mol. Carcin. 27:158-165; 
Ornstein, D.K. et al. (2000) "Proteomic Analysis Oflaser Capture 
25 Microdissected Prostate Cancer And In Vitro Cell Lines," Electrophoresis 
21 :2235-2242; Carlisle, A. et al. (2000) "Development Of A Prostate cDNA 
Micro Array And Statistical Gene expression Analysis Package," Mol. 
Carcin. 27:1-1 1; Strausberg, RL. etal. (2000) "The Cancer Genome Anatomy 
Proiect: Building An Annotated Gene Index," Trends in Genet. 1 6: 1 03-1 06; 
30 Paweletz, CP. et al. (2002) "Loss Of Annexin 1 Correlates With Early 
Onset Of Tumorigenesis In Esophageal And Prostate Carcinoma," Cancer 
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Res. 60:6293-6297; Kang, j. et al. (2002) "Dysregulation Of Annexin I 
Protein Express.on in High-GradeProstatic Intraepithelial Neoplasia 
And Prostate Cancer," Clin. Cancer Res. 8: 1 17-123). This permits candidate 
prostate cancer-associated changes that were initially identified in a small set of 
cases to be (in)validated in a large number of specimens. Moreover, expression 
profiles can be assessed in all cell populations present (various tumor grades, pre- 
malignant lesions, normal epithelium, and lymphocytes associated with tumors 
cells), facilitating a more thorough investigation of the identified gene set in the 
disease process. 

"Expression Microdissection" (LANC) is a front-end method for 
performing "virtual microdissection" of a specific cell type within a heterogeneous 
environment, based on expression of a target molecule. The approach is intended 
to improve the accuracy of global expression measurements, such that the data 
produced are robust and valid. Expression Microdissection (LANC) is performed 
using a specially designed polymer tethered to an antibody (or nucleic acid) for cell 
targeting, and to an enzyme for subsequent labeling of nucleic acids or proteins. 
The polymer is tethered to both a secondary antibody and a labeling enzyme. The 
polymer complex is hybridized to a primary antibody that is used to target a cell 
population of interest, followed by an enzymatic reaction that specifically labels 
the nucleic acids or proteins in the targeted cells. A portion of, or more preferably, 
the entire, tissue section is then scraped into a tube containing lysis buffer, and the 
labeled biomolecules analyzed by an appropriate detection method. The 
Expression Microdissection (LANC) procedure may be modified so as to employ it 
in conjunction with LCM (for example, by replacing the "labeling enzyme" with a 
dye (or dye-generating enzyme) that can activate the capture film). This approach 
may be useful when an investigator needs to physically procure biomolecules from 
a relatively large number of targeted cells. Once the polymer has been hybridized 
to the cell type of interest, the enzyme catalyzes a reaction that adds a label 
specifically to the biomolecules of the targeted cells. For example, reverse 
transcriptase can be used to create fluorescently labeled cDNA for arrays, or 
lactoperoxidase can be used to add l2S I to proteins. After the enzymatic reaction, 
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the entire biological sample is placed into a tube for analysis, utilizing a detection 
method in which only the labeled transcriptome or proteome is visualized. For 
experiments using targeting of cellular proteins, the polymer is linked to a 
secondary antibody, thus allowing the investigator to employ any primary antibody 
5 of interest in their experiment LES, Expression Microdissection (LANC), LCM, 
and immuno-LCM can be important and complementary tools that will assist 
investigators in phenotype- and expression-based profiling studies of cell 
populations (Emmert-Buck, M.R. et al (1996) "Laser Capture 
Microdissection," Science 274:998-1001; Bonner, R.F. et al (1997) "Laser 
1 0 Capture Microdissection: Molecular Analysis Of Tissue," Science 
278:1481-1483; Fend, F. et al (1999) "IMMUNO-LCM: LASER CAPTURE 
Microdissection Of Immunostained Frozen Sections For mRNA Analysis," 
Amer. J, Pathol. 154:61-66). 

All publications and patents mentioned in this specification are herein 
15 incorporated by reference to the same extent as if each individual publication or 
patent application was specifically and individually indicated to be incorporated by 
reference. 

While the invention has been described in connection with specific 
embodiments thereof, it will be understood that it is capable of further 
20 modifications and this application is intended to cover any variations, uses, or 
adaptations of the invention following, in general, the principles of the invention 
and including such departures from the present disclosure as come within known 
or customary practice within the art to which the invention pertains and as may be 
applied to the essential features hereinbefore set forth. 
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What Is Claimed Is: 

1 . A method for analyzing the transcriptome of a cellular sample comprising 
analyzing two or more molecular, species present in a 2-dimensionaI array 
of said cellular sample, wherein said method comprises treating said 2- 
dimensional array with an External Movement Inhibitor device having 
multiple discrete partitions, so as to sequester molecules present in said 
array into one or more discrete regions, wherein said treatment preserves 
the positional relationship of the molecules of said 2-dimensional array, and 
permits a determination of the locations) in said cellular sample in which 
said molecular species are present. 

2. The method of claim 1 , wherein said cellular sample is a cellular sample 
obtained from a mammal. 

3. The method of claim 2, wherein said mammal is a human. 

4. The method of claim 1 , wherein said cellular sample is a tissue sample. 
The method of claim 4, wherein said tissue sample is a biopsy. 



6. The method of claim 1, wherein said molecular species are nucleic acid 
molecules. 

7. The method of claim 6, wherein said method additionally comprises 
incubating the sequestered nucleic acid molecules of two or more regions 

20 under conditions sufficient to permit the manipulation of one or more 

preselected nucleic acid molecules if present at said regions, while 
preserving the positional relationship of said molecules relative to other 
molecules of said 2-dimensional array. 

8 The method of claim 7, wherein said method additionally comprises 
25 • transferring said manipulated nucleic acid species to two or more 
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membranes, said membranes being differentially treated to enable the 
determination of the location(s) of manipulated nucleic acid species. 

9. The method of claim 7, wherein one or more of said preselected nucleic 
acid mo!ecuIe(s) are diagnostic of a disease state. 

5 10. The method of claim 7, wherein said manipulation is selected from the 
group consisting of nucleic acid amplification, reverse transcription, 
labeling, cloning, and the assaying of a biomolecule. 

11. The method of claim 7, wherein said method comprises incubating the 
sequestered nucleic acid molecules of all of said regions under conditions 

1 0 sufficient to permit the manipulation of said one or more preselected 

nucleic acid molecules. * 

12. The method of claim 1 1, wherein one or more of said preselected nucleic 
acid molecule(s) are diagnostic of a disease state. 

13. The method of claim 1 1, wherein said manipulation is selected from the 
1 5 group consisting of nucleic acid amplification, reverse transcription, 

labeling, cloning, and the assaying of a biomolecule. 

14. The method of claim 6, wherein said cellular sample is an extract of a cell, 
and said 2-dimensionaI array is a gel or membrane that arrays said nucleic 
acid molecules. 

20 15. The method of claim 1 4, wherein said method additionally comprises 

incubating the sequestered nucleic acid molecules of two or more regions 
under conditions sufficient to permit the manipulation of one or more 
preselected nucleic acid molecules if present at said regions, while 
preserving the positional relationship of said molecules relative to other 

25 molecules of said 2-dimensional array. 
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The method of claim 15, wherein one or more of said preselected nucleic 
acid moIecule(s) are diagnostic of a disease state. 

The method of claim 15, wherein said manipulation is selected from the 
group consisting of nucleic acid amplification, reverse transcription, 
labeling, cloning, and the assaying of a biomolecule. 

The method of claim 15, wherein said method additionally comprises 
transferring said manipulated nucleic acid species to two or more 
membranes, said membranes being differentially treated to enable the 
determination of the location(s) of manipulated nucleic acid species. 

The method of claim 15, wherein said method comprises incubating the 
sequestered nucleic acid molecules of all of said regions under conditions 
sufficient to permit the manipulation of said one or more preselected 
nucleic acid molecules. 

The method of claim 1 9, wherein one or more of said preselected nucleic 
acid molecule(s) are diagnostic of a disease state. 

The method of claim 19, wherein said manipulation is selected from the 
group consisting of nucleic acid amplification, reverse transcription, 
labeling, cloning, and the assaying of a biomolecule. 

The method of claim 1, wherein said molecular species are protein 
molecules. 

The method of claim 22, wherein said cellular sample is an extract of a cell, 
and said 2-dimensional array is a gel or membrane that arrays said 
molecules of said extract 



The method of claim 22, wherein one or more of said protein molecuie(s) 
are diagnostic of a disease state. 
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Abstract of the Invention: 

This invention relates to methods and apparati for performing multiple 
simultaneous manipulations of biomolecules in a two-dimensional array, such as a 
gel, membrane, tissue biopsy, etc. Such manipulations particularly include assays 
and nucleic acid amplification protocols. 
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